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Abstract 


Global healthcare systems are challenged by the COVID-19 pandemic. There is a need to opti- 
mize allocation of treatment and resources in intensive care, as clinically established risk 
assessments such as SOFA and APACHE II scores show only limited performance for predict- 
ing the survival of severely ill COVID-19 patients. Additional tools are also needed to monitor 
treatment, including experimental therapies in clinical trials. Comprehensively capturing 

human physiology, we speculated that proteomics in combination with new data-driven analy- 
sis strategies could produce a new generation of prognostic discriminators. We studied two 
independent cohorts of patients with severe COVID-19 who required intensive care and inva- 
sive mechanical ventilation. SOFA score, Charlson comorbidity index, and APACHE II score 
showed limited performance in predicting the COVID-19 outcome. Instead, the quantification 
of 321 plasma protein groups at 349 timepoints in 50 critically ill patients receiving invasive 
mechanical ventilation revealed 14 proteins that showed trajectories different between survi- 
vors and non-survivors. A predictor trained on proteomic measurements obtained at the first 
time point at maximum treatment level (i.e. WHO grade 7), which was weeks before the out- 
come, achieved accurate classification of survivors (AUROC 0.81). We tested the established 
predictor on an independent validation cohort (AUROC 1.0). The majority of proteins with high 
relevance in the prediction model belong to the coagulation system and complement cascade. 
Our study demonstrates that plasma proteomics can give rise to prognostic predictors substan- 
tially outperforming current prognostic markers in intensive care. 


Author summary 


Healthcare systems around the world are struggling to accommodate high numbers of the 
most severely ill patients with COVID-19. Moreover, the pandemic creates a pressing need 
to accelerate clinical trials investigating potential new therapeutics. While various biomarkers 
can discriminate and predict the future course of disease for patients of different disease 
severity, prognosis remains difficult for patient groups with similar disease severity, e.g. 
patients requiring intensive care. Established risk assessments in intensive care medicine 
such as the SOFA or APACHE II show only limited reliability in predicting future disease 
outcomes for COVID-19. In this study we hypothesized that the plasma proteome, which 
reflects the complete set of proteins that are expressed by an organism and are present in the 
blood, and which is known to comprehensively capture the host response to COVID-19, can 
be leveraged to allow for prediction of survival in the most critically ill patients with COVID- 
19. Here, we found 14 proteins, which over time changed in opposite directions for patients 
who survive compared to patients who do not survive on intensive care. Using a machine 
learning model which combines the measurements of multiple proteins, we were able to 
accurately predict survival in critically ill patients with COVID-19 from single blood samples, 
weeks before the outcome, substantially outperforming established risk predictors. 
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Introduction 


The COVID-19 pandemic has brought health systems around the globe to the brink of col- 
lapse. Capacities for intensive care treatment of patients with organ failure have reached their 
limits in many regions with intense SARS-CoV-2 transmission and were often central to politi- 
cal decisions regarding restrictions on public life, e.g. through contact restrictions or lock- 
downs. The global impact of the pandemic increases the pressures to devise new clinical 
approval strategies so that potential therapeutics can be identified and tested faster, at higher 
accuracy, and in clinical trials with smaller sample sizes [1]. Various models for classification 
of disease severity and for prediction of clinical trajectories and outcome have been developed 
for COVID-19, based on laboratory measurements, clinical scores, imaging, and omics tech- 
nologies [2-5]. These pointed to the importance of specific immune cells, inflammatory and 
antiviral cytokines and chemokines, as well as the coagulation cascade in COVID-19 disease 
progression [5-13]. They predict the risk of the future need for mechanical ventilation in the 
heterogeneous group of patients at early time points, e.g. at admission to the hospital, when 
clinical parameters and biomarkers differ substantially between mildly affected and severely ill 
patients [2-5,14]. 

Treatment decisions within the most severely ill patients, for instance whether a patient 
should be treated with extracorporeal membrane oxygenation (ECMO), have a major 
impact on resources. Currently, such decisions are often based primarily on the patient’s 
age, comorbidities, and established intensive care prognosis models, such as the Sequential 
Organ Failure Assessment (SOFA) or Acute Physiology and Chronic Health Evaluation 
(APACHE I), which assess the patient on the basis of a combination of established clinical 
and laboratory risk parameters [15,16]. However, the predictive values of both SOFA and 
APACHE II for the most critical forms of COVID-19 are limited [17-19], creating a diag- 
nostic gap and imminent need for reliable predictors, specifically validated in severely ill 
COVID-19 patients, to guide and tailor efforts in treating these critically ill patients. More- 
over, the lack of reliable predictors increases the challenge of interpreting the results of 
early phase clinical trials, which typically enroll low numbers of patients. Indeed, testing 
for the success of a clinical intervention requires classification of divergent clinical trajecto- 
ries within more homogeneous groups, such as WHO grade 7 patients. At least in COVID- 
19, this is hampered by the fact that molecular signatures within a group of patients with 
comparable disease severity are considerably more similar when compared to the differ- 
ences between mild and severe patients [6,7,14]. 

Plasma proteomics holds the promise of integrating the genetic background of an individ- 
ual with their life history, physiological, nutritional, and demographic parameters, and hence, 
have the potential to form the foundation of a new generation of predictors [20-24]. Among 
the spectrum of proteomic technologies available, mass spectrometry has the appeal that once 
markers are identified, they allow for the direct generation of targeted panel assays measurable 
by selective reaction monitoring (SRM), simplifying their implementation into clinical rou- 
tine. Recently, new mass spectrometry based proteomic technologies have been developed to 
increase throughput and measurement precision, so that the path from discovery to applica- 
tion is simplified [6,25-28]. 

We studied proteomes of two well characterized cohorts of the most severely ill patients 
with COVID-19 in two independent health care centers (Charité-Universitaétsmedizin Ber- 
lin, Germany, and Medical University of Innsbruck, Austria) who gave informed consent 
to deep clinical and molecular phenotyping [14,19,29]. Using a recently published dataset 
from our group [14], we specifically assessed whether proteomic measurements can be 
used to predict the outcome (death vs. survival) of severe COVID-19 from time series data, 
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as well as from samples taken at key clinical decision points. We found 14 protein concen- 
tration trajectories that, over the timeline of disease progression, distinguish survivors 
from non-survivors. Moreover, a machine learning (ML) model, based on parenclitic net- 
works, generated accurate prognosis on single time point samples that were collected once 
the patient reached the maximum treatment level. Emphasizing the prognostic potential of 
the proteome, this sample was, in median, taken 39 days before outcome. The ML predictor 
trained on these samples substantially outperformed established clinical risk scores and 
predicted the outcome among a group of severely ill patients with similar clinical presenta- 
tion with high accuracy. 


Results 


The exploratory cohort used for marker identification and model generation consisted of the 
50 most severely ill COVID-19 patients out of a cohort of 168 patients with varying disease 
severity, treated between 15 March and 16 September 2020 at Charité University Hospital, Ber- 
lin, Germany, a tertiary care referral centre for the treatment of ARDS with associated weaning 
centre (Fig 1A) [14,19,29]. There were no treatment restrictions due to shortages of intensive 
care capacity at the time of this patient cohort. The 50 patients selected for the study were 
treated in intensive care with invasive mechanical ventilation plus additional organ support 
such as renal replacement therapy (RRT), ECMO, or vasopressors, corresponding to grade 7 
on the WHO Ordinal Scale for Clinical Improvement. Patients with limitations of therapy 
according to their wish were excluded. Thirty-six (72%) patients required RRT, 19 (38%) 
patients were treated with ECMO, and 16 (32%) patients were treated with both RRT and 
ECMO. Fifteen (30%) patients died. Median time of hospitalization in survivors was 63 days 
(n = 35, IQR 44-89). Median time from admission to death was 28 days (n = 15, IQR 16-43). 
Patient characteristics are shown in S1 Table. The details on the proteomic workflow, protein 
detection rates, as well as patient trajectories are provided in S1 and S2 Figs of our previous 
work [14]. 

Within this treatment group of critically ill COVID19 patients, the Charlson Comor- 
bidity Index [30,31] performed poorly in classifying survivors from non-survivors by 
AUROC values of 0.63 (P = 0.16, Fig 2A). From a time-resolved data resource for the 
PA-COVID-19 study, spanning over a compendium of clinical parameters, plasma prote- 
omes, cell counts, enzyme activities, and outcomes [14], we further determined the SOFA 
and APACHE II scores. These scores, too, could not confidently distinguish survivors 
from non-survivors (Fig 2A, AUROC = 0.68, P = 0.05 for APACHE I] score at ICU admis- 
sion, and AUROC = 0.65, P = 0.11 for SOFA score at the time of first sampling at WHO 
grade 7). 

Studying the plasma proteomes [14] we found 78 proteins for which the concentration 
changed significantly during the patients’ disease course. Out of these proteins, 14 were 
found to change differently over time for survivors and non-survivors (Fig 1B, Fig 1C). 
Patients with fatal outcomes were characterized by a significant increase in inflammatory 
proteins over time (SAA1, SAA2, CRP, ITIH3, LRG1, SERPINA1, SERPINA10 and LBP). 
Conversely, the levels of these proteins in plasma decreased over time in survivors. More- 
over, anti-inflammatory proteins (SERPINA4, A2M) decreased over time in non-survivors, 
indicating a persistent pro-inflammatory signature. Similarly, two key proteins of the coag- 
ulation system, thrombin (F2) and plasma kallikrein (KLKB1), known to be decreased in 
severe COVID-19 [12,14], further decreased over time in non-survivors, while increasing 
in survivors. 
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Fig 1. Protein concentration trajectories that differentiate survivors of critical COVID-19 from non-survivors. a) Fifty 
Patients with PCR-confirmed COVID-19 treated at Charité University Hospital Berlin, Germany, were sampled 
longitudinally, to generate high-resolution time series for 321 protein quantities. In parallel, precise clinical phenotyping was 
performed, including recording of intensive care and disease severity scores, treatment parameters, and outcome 
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(PA-COVID-19 data resource [14]). b) Protein level trajectories over time (FDR < 0.05), for which time-dependent 
concentration changes (y-axis: log2 fold change) during the peak of the disease differentiate survivors from non-survivors in 
critically ill patients (Methods). ¢) as b) but expressed as boxplots (log2 fold change last vs first day). Figure created with 
BioRender.com. 


https://doi.org/10.1371/journal.pdig.0000007.g001 


For diagnostic purposes and treatment decisions time series data is however impractical to 
obtain. We therefore explored the potential of using single time point samples to predict out- 
come. We chose the earliest sample obtained after the critical decision regarding escalation of 
treatment, i.e. the earliest sample obtained at the maximum treatment level (WHO grade 7), to 
generate an outcome predictor. The median time from sampling until the outcome was 39 
(IQR 16-64) days in our cohort. Using 57 proteins for which targeted mass spectrometric 
assays (MRM assays) as listed in the MRMAssayDB [32] are available, indicating that they 
have been selected for a clinical or biomedical indication also in other context, we established 
a machine learning model based on parenclitic networks, a graph-based approach in which 
networks representing the deviation of an individual from the population are derived [33,34]. 
The networks are generated by considering every pair of analytes (proteins) individually and 
calculating the respective edge weight as the estimated probability of fatal outcome based on 
this pair of proteins. Predictive models are then generated by considering the topological dif- 
ferences between networks from individual cases (non-survivors vs. survivors) (Methods). We 
achieved high prediction accuracy on the test subjects, who were excluded when training the 
machine learning model (in a cross-validation fashion, see Methods), with AUC = 0.81 (95% 
CI 0.68-0.94) for the receiver-operating characteristic (ROC) curve (Fig 2B). Out of the 25 
proteins with the highest relevance in the parenclitic model, 15 are components of the coagula- 
tion system and 8 proteins belong to the complement cascade (S2 Table). To further demon- 
strate that the proteomic data contains sufficient physiological information to allow outcome 
prediction, i.e. that the results are not restricted to a specific algorithm, we also tested a model 
based on a support vector machine (SVM). The SVM proved to be capable of survival predic- 
tion as well, albeit with inferior performance compared to the parenclitic network (S1 Fig). 

To independently validate the potential of the plasma proteome to predict outcomes in crit- 
ically ill COVID19 patients, we examined the performance of the parenclitic network trained 
on our prime cohort (Charité) on an independent cohort of 24 patients with critical COVID- 
19 from Austria (survival n = 19, death n = 5, median time between sampling and outcome 22 
days, interquartile range 15-42 days) (‘Innsbruck cohort, Methods). Despite the validation 
cohort originating from a different hospital and health care system, the machine learning 
model demonstrated high predictive power on this independent cohort (AUROC = 1.0, 

P = 0.000047, Fig 2C). Using the cutoff value for survival prediction derived from the Charité 
cohort, the model correctly predicted the outcome for 18 out of 19 patients who survived and 
for 5 out of 5 patients who died in this independent ‘Innsbruck’ cohort. 


Discussion 


The prognostic value of several biomarkers (e.g. CRP, IL-6, ferritin) and clinical scores for pre- 
dicting disease progression in COVID-19 at early disease stages, e.g. at hospital admission, is 
now well established [35,36]. For the comparatively homogeneous subgroup of severely ill 
patients already requiring mechanical ventilation and additional organ support, prediction of 
future disease trajectories and outcome (survival or death) is by far more challenging, and only 
limited data exist [17,37,38]. Moreover, clinical severity scores are often not validated for 
unconscious patients, and laboratory measurements are frequently confounded by intensive 
care treatment. Outcome of ICU patients may further be critically determined by resource 
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Fig 2. Prediction of survival or death in critically ill patients, from the first sampling time point at intensive care treatment 
level (WHO grade 7). a) Performance of established ICU risk assessment indices (APACHE II, SOFA and Charlson comorbidity 
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index) calculated at the time of ICU admission (APACHE II, Charlson comorbidity index) or at the first time point at WHO grade 
7 (SOFA score) in predicting the outcome in critically ill patients. b) Prediction of survival or death in critically ill patients using 
proteomics. A machine learning model based on parenclitic networks (Methods) was trained on the samples of the Charité cohort 
closest to the time point of treatment escalation during intensive care (start of ECMO, RRT or vasopressors, i.e. WHO grade 7). 
The performance was assessed on the test samples, which were held out during training. Upper panel: The ROC curve indicates 
correct classification of survival vs non-survival with an AUROC of 0.81 (95% CI 0.68-0.94). Middle panel: The proteomic 
classifier was used to predict the probability of survival and non-survival, which is significantly different between the groups. 
Lower panel: Kaplan-Meier survival curves using a threshold of predicted probability (0.678) chosen to maximize Youden’s J 
index (J = sensitivity + specificity—1). Log-rank test was used to compare survival rates between patients with predicted death 
risk < 0.678 (black) and > 0.678 (orange). c) (upper, middle, and lower panels): The model trained on the Charité cohort, was 
tested on an independent cohort (Innsbruck). d) Exemplary parenclitic networks from two patients in the independent Innsbruck 
cohort. Edges with weights > 0.5 are shown. Left panel: a network predicting low probability of death in a surviving patient. Right 
panel: a network predicting high probability of death in a non-survivor. 


https://doi.org/10.1371/journal.pdig.0000007.g002 


constraints, the varying level of experience with organ replacement therapies or the rates of 
superinfection, rendering prediction complex [38]. On the other hand, patients in intensive 
care units, and particularly those in need of special organ replacement therapies such as 
ECMO, require a disproportionately large share of resources compared to other patients, so 
decisions to initiate such therapies should be based on the best information and assessment 
possible. Prognostic tools in critically ill patients are hence of crucial importance to guide and 
tailor the treatment efforts. This is particularly true in a situation when health care systems are 
overstrained. Another key potential for the use of outcome predictors is clinical trial monitor- 
ing, where measurements of prognostic molecular signatures over time can be used to evaluate 
experimental therapies on an individual, time-resolved basis. Moreover, an accurate outcome 
predictor would allow us to test whether a given treatment changes the predicted trajectory of 
an individual patient. 

Previously, we and others investigated plasma proteome alterations in COVID-19 [6- 
8,10,12,14], which show a remarkable ability to classify the severity of disease. For instance, our 
investigations showed that the host response in the early inflammatory phase creates a strong 
signature in the plasma proteome, and is critical as well as predictive about the future disease 
progression in severe COVID-19 [14]. New proteomic platform technologies have significantly 
gained precision and throughput compared to their predecessors, rendering the application of 
multivariate regression models more effective and bringing them increasingly close to routine 
clinical use [6]. Importantly, even without platform technologies, biomarkers identified in 
proteomic profiles can be translated into clinical use, e.g. by using standard techniques such as 
selective reaction monitoring (SRM) for the quantification of protein panels, or enzyme linked 
immunosorbent assays (ELISA) for the sensitive quantification of individual biomarkers. 

Here, we show that an increase in specific inflammatory and acute phase proteins over time 
(e.g., SAA1;SAA2, CRP, ITIH3, LRG1, SERPINA1, and LBP) is associated with the risk of 
death from COVID-19, while an increase of kallikrein (KLKB1), kallistatin (SERPINA4), 
thrombin (F2), apolipoprotein C3 (APOC3), GPLD1, and the protease inhibitor A2M, is asso- 
ciated with survival. Interestingly, we and others have found all of these proteins to also be dif- 
ferentially expressed depending on disease severity in COVID-19 [6,7,10,12,14]. Moreover, 
there is substantial overlap with a panel of proteins predictive of mortality in COVID-19 iden- 
tified by Véllmy et al. [39]. Hence, despite only a subset of proteins that are differentially con- 
centrated depending on disease severity predict outcome, and the fact that typical single- 
centre ICU studies are conducted on small numbers of patients, this result indicates a high 
congruence and reproducibility of plasma proteome signatures across studies. 

SAA1;SAA2, CRP, ITIH3, SERPINA1 are acute phase proteins that are also dysregu- 
lated in other inflammatory states including sepsis [40]. Increased LRG1 and LBP as well 
as decreased A2M [40,41] are indicators of an ongoing immune response, complementing 
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the general pro-inflammatory signature of these predictive proteins. APOC3 and GPLD1 
are involved in lipid metabolism, which has been shown to be dysregulated in bacterial 
pneumonia, thereby associated with unfavorable outcomes [42]. Kallikrein is involved in 
the blood coagulation system, fibrinolysis, and the complement cascade, three systems 
known to be dysregulated in COVID-19 [43-45]. It mediates the cleavage of kininogen to 
bradykinin and des-Arg®-bradykinin, a potent vasoactive peptide which is counter-regu- 
lated by ACE2, the cell entry receptor for SARS-CoV-2. Since the loss of ACE2 in COVID- 
19 supposedly leads to an imbalance of bradykinins, inhibition of the kallikrein-kinin sys- 
tem has been discussed as a treatment strategy in COVID-19 [46-48]. This hypothesis is 
not supported by our data, which indicate improved prognosis with increasing kallikrein 
levels. Kallikrein is counterbalanced by kallistatin, which equally increased over time in 
survivors in our study population, thereby potentially equilibrating the increase in the 
kinin-kallikrein system. Kallistatin is known for pleiotropic effects in vascular repair, 
endothelial function, and inflammation [49] and possesses protective properties in acute 
lung injury. According to our data kallistatin should be considered as a potential candidate 
for clinical testing in critical COVID-19 [50]. 

While prognostic assessments based on repeated measurements over time allow for treat- 
ment monitoring, including evaluation of experimental therapies in clinical trials, prognostic 
measurements from single time points are particularly valuable for timely patient management 
and resource allocation. We therefore employed a machine learning model to integrate proteo- 
mic measurements from the first time point at WHO grade 7, i.e. invasive mechanical ventila- 
tion and additional organ support therapy, in order to derive prognosis of outcome. We 
achieve high prognostic values, both in the exploratory cohort, as well as in a fully independent 
cohort. 

The results are currently based on a comparatively small number of patients with adverse 
outcome. Given the naturally small sample sizes of ICU cohorts and the exploratory character 
of our study, findings will have to be validated in larger cohorts, before further steps can be 
undertaken to translate our findings into clinical practice in the future. The panel of proteins 
identified in our study should also be assessed for other conditions such as non-COVID-19 
ARDS. 

The majority of proteins with the highest relevance for the machine learning predictor were 
components of the coagulation system and the complement cascade (S2 Table). Both systems 
are known to be crucial for treatment and disease courses for severely ill COVID-19 patients 
[9,10]. This is particularly well illustrated by recent data from a multi-platform clinical trial 
indicating that a substantial proportion of patients with severe COVID-19 develop thrombo- 
embolic events despite therapeutic anticoagulation [51,52]. The protein with the highest rele- 
vance in our model is Fetuin-A (AHSG), which is known to be strongly downregulated in 
severe COVID-19 [10,14]. Of note, genetic polymorphisms associated with higher AHSG 
plasma concentrations were found to be protective in SARS-CoV-1 infection [53]. One impor- 
tant function of AHSG is regulation of inflammation through deactivation of macrophages 
[54], and there is emerging evidence that macrophages play a key role in pulmonary inflamma- 
tion and dysfunction in COVID-19 [11,55-57]. A number of proteins identified as outcome 
predictors have also been shown to be differentially expressed in sepsis, including SAA1, CRP, 
SERPINA1, KLKB1, and A2M [40], indicating a general inflammatory signature rather than 
specific markers of COVID-19. 

In summary, we have leveraged the power of the proteome to address a problematic diag- 
nostic gap in the prognosis of the most critical form of COVID-19, that is not covered by 
established clinical assessments, such as the SOFA or APACHE II scores. We show that the 
proteome accurately predicts survival in critically ill patients with COVID-19, from samples 





PLOS Digital Health | https://doi.org/10.1371/journal.pdig.0000007 January 18, 2022 9/17 


PLOS DIGITAL HEALTH 


A proteomic survival predictor for COVID-19 patients in intensive care 





that were collected 39 days in median before the outcome. The findings warrant further pro- 
spective assessment of proteomic predictors and the described models in larger cohorts. The 
majority of proteins with high relevance in the model are components of the coagulation sys- 
tem and complement cascade, highlighting their critical role in progression and outcome of 
most severe COVID-19. 


Methods 


Charité patient cohort and clinical data 


Patients included in this analysis are a sub-cohort of the PA-COVID-19 study conducted at 
Charité— Universitatsmedizin Berlin, a prospective observational cohort study on the patho- 
physiology of COVID-19 as described previously [14,19,29]. All patients with PCR-confirmed 
SARS-CoV-2 infection that progressed to critical disease (WHO grade 7, i.e. invasive mechani- 
cal ventilation and additional organ support), were eligible for inclusion. Exclusion criteria 
included refusal to provide informed consent by the patient or a legal representative, and any 
condition prohibiting serial biosampling. Patients were treated according to current clinical 
guidelines. Patients for whom limitation of therapy was decided according to the patient’s 
wish were excluded from analysis. This includes three cases, for whom limitation of therapy 
was decided at a later time point according to the patient’s presumed wish and predictably 
unfavorable outcome. All other patients received maximum intensive care treatment including 
organ replacement therapies at the discretion of the responsible physicians. One patient (ID 
135), who was still hospitalized and clinically improving 5 months after admission, was classi- 
fied as a survivor. One patient still in critical condition 5 months after admission was excluded 
due to uncertain outcome. 

Biosampling of EDTA plasma for proteome measurement was performed up to 3 times per 
week after inclusion. Disease severity was assessed according to the WHO ordinal scale for 
clinical improvement (World Health Organisation 2020). Clinical data were captured in secu- 
Trial (interActive Systems GmbH, Berlin, Germany). Pseudonymized data exported from 
secuTrial were processed using JMP Pro 15 (SAS Institute Inc., Cary, NC, USA). 


Innsbruck Patient cohort and clinical data 


Serum samples from patients admitted to the intensive care unit at the Department of Medicine, 
University Hospital of Innsbruck with PCR-confirmed severe COVID-19 were collected within 
the first days (median 7.5, IQR 5-12) after admission, and written informed consent was 
obtained. Patients were treated according to national guidelines. The study was approved by the 
local ethics research committee EK-Nr. 1107/2020, and EK-Nr. 1103/2020 for follow-up. 


Statistical analysis and multiple-testing correction 


Statistical testing on proteomic and diagnostic data was performed in the R environment 
for statistical computing, version 3.6.0 [58], as described previously [14]. Briefly, all protein 
measurements were first log2-transformed and only protein groups matched to at least 
three different peptides were considered. Quantities of gene products corresponding to 
open reading frames IGxx (i.e. different types of immunoglobulin chains) were summed 
together to generate quantities representative of the overall levels of immunoglobulin clas- 
ses (IGHVs, IGLVs, etc). Imputation of missing data was not performed. Significance test- 
ing for equal medians was performed using the Mann-Whitney U test, as implemented in 
the “wilcox.test” function of the “stats” R package. A non-parametric test was chosen here 
to minimise the influence of outliers on the calculated p-values. Multiple-testing correction 
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was performed using the Benjamini-Hochberg false discovery rate controlling procedure 
[59], implemented in the “p.adjust” function of the “stats” R package. Adjusted p-values 
below 0.05 were considered significant. 


Identifying omics trajectories that are predictive of survival at the peak 
period of the disease 


For each omics feature, the difference between its log2-levels at the last and the first sampling 
timepoints during the peak period of the disease was considered. This period was defined as 
the time when the patient was receiving the most intensive treatment during their stay in hos- 
pital, that is the time when the patient was at WHO grade 6 or 7. The distribution of this differ- 
ence between survivors and non-survivors was compared using the Mann-Whitney U test. 
Only non-DNI patients with known outcome were included. 


Prediction of survival 


The first time point measured at the WHO grade 7 was selected per patient, to train the sur- 
vival predictor. This ensured that ‘future’ information, encoded in the later time points, was 
not used for predictor training. To reduce the feature space used as input for the machine 
learning model, we limited it to the quantities of 57 proteins which are FDA-approved bio- 
markers with MRM assays available [32] and which were quantified with at least three different 
peptides in this study. Missing values were imputed using minimal value imputation, and the 
data were standardized. 

Machine learning was carried out using the parenclitic networks approach [33,34]. Briefly, 
during training, for each pair of features, a radial SVM classifier is trained (using the svm() 
function from the “e1071” R package with default settings). For each sample, a network is then 
built, wherein vertices correspond to features and the edge weight is the death probability as 
predicted by the SVM classifier. Maximum, mean and standard deviation of the edge weights, 
as well as the numbers of edges with weights greater than 0.5 (i.e. fatal outcome is predicted) 
and nodes with at least one such edge are calculated. A LASSO classification model 
(alpha = 0.01) is then constructed on these 5 features using the glmnet() function of the 
“glmnet” [60] R package with default settings. 

For the assessment of the classifier performance (Charité cohort), a cross-validation method 
was applied in the following way: the prediction was made for each sample by excluding (with- 
holding) it from the dataset along with two other samples (chosen randomly with the con- 
straint that out of 3 samples one corresponds to a non-survivor and two to survivors), training 
the classifier on the remaining (independent) samples and then generating predictions for the 
withheld samples using the trained model. Such a leave-3-out partition was generated ran- 
domly 50 times and the predictions for each sample were averaged. The partitioning strategy 
ensured that the evaluation of the predictive performance would not be affected by any poten- 
tial overfitting, no matter how significant. For the assessment of the performance on an inde- 
pendent dataset (Innsbruck cohort), the classifier was trained on all the Charité samples and 
used to estimate the probabilities of fatal outcome on the Innsbruck cohort. The source code is 
provided in supplementary materials. 

The ‘relevance’ scores for proteins in the parenclitic model were calculated as Kleinberg’s 
authority centrality scores for the respective vertices in the “generalizing network”. This net- 
work was generated by (i) replacing edge weights greater than 0.5 with 1.0 and weights less 
than 0.5 with 0.0 in the networks corresponding to non-survivors and (ii) averaging the result- 
ing networks. 
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For survival prediction using support vector machines (SVM), the same data and the same 
selection of proteins as for the parenclitic network model was applied. The SVM was built in 
Python 3.8.5 using the SVC() function with an rbf-kernel and a gamma value of 0.005 as 
implemented in scikit-learn 0.23.2 [61]. To circumvent class-imbalances, balanced class- 
weights were assumed. For benchmarking the model a stratified 10-fold cross-validation was 
performed. The data were scaled to zero mean and unit variance based on the training data. 
The reported results for the Charité-cohort are based on the data that were withheld when con- 
structing the model in each cross-validation step. For validating the model, a model was 
trained on all samples of the Charité-cohort and validated using the independent Innsbruck- 
cohort. p-Values were calculated using the Mann-Whitney U test as implemented in SciPy 
1.5.2 [62]. AUC values and confidence intervals were obtained using the roc() function of the 
pROC R package. 

We followed the guidelines for transparent reporting of multivariable prediction models for 
individual prognosis or diagnosis (TRIPOD) as proposed by the EQUATOR network [63]. 


Study approval 


The study was approved by the ethics committee of Charité—Universitatsmedizin Berlin 
(EA2/066/20) and conducted in accordance with the Declaration of Helsinki and guidelines of 
Good Clinical Practice (ICH 1996). Written informed consent was obtained from all patients 
or legal representatives according to regulations set by the ethics committee of Charite—Uni- 
versitatsmedizin Berlin. The study is registered in the German and the WHO international 
registry for clinical studies (DRKS00021688). 
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$1 Tripod Checklist. Transparent reporting of multivariable prediction models for indi- 
vidual prognosis or diagnosis (TRIPOD) checklist as proposed by the EQUATOR network 
[63]. Checklist includes location of key aspects within the manuscript. 
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S1 Table. Baseline, treatment, and outcome characteristics of patient cohort with severe 
COVID-19 receiving maximum therapy at Charité—University hospital Berlin. 
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$2 Table. Top 25 proteins included in the machine learning model, ordered by their esti- 
mated ‘relevance’ scores (Methods). Red writing indicates proteins involved in the comple- 
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$1 Fig. Performance of an SVM model in predicting survival for critical (WHO grade 7) 
COVID-19 patients. Left panel: Boxplot of the decision function of the SVM for the Charité 
cohort. Displayed is the performance on the test data that were not used for model training. 
Middle panel: Boxplot of the decision function of the SVM for the Innsbruck cohort using a 
pre-trained model based on the Charité-cohort. Right panel: ROC-Curve and AUC corre- 
sponding to the boxplots for the Charité-cohort (black) and for the Innsbruck-cohort (red). 
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